Learning State-Based Behaviour using Temporally Related Cases

نویسندگان

  • Michael W. Floyd
  • Babak Esfandiari
چکیده

Learning by observation allows a software agent to learn an expert’s behaviour, by examining the actions the expert performs in response to inputs, without the expert having to explicitly program the agent. Most learning by observation approaches only make use of the current inputs and actions of the expert and ignore any past inputs or actions. This limits the agents to only being able to learn reactive behaviour. We present an approach to case retrieval that uses the expert’s past inputs and actions in order to allow for learning state-based behaviour. We demonstrate our approach by learning from a simulated obstacle avoidance robot that reasons using internal state information. Our results show a significant accuracy improvement over retrieval that does not take into account any past information.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Designing a Model of Human Resources Training with Integrated Learning Approach in State Banking

Human resource studies seeks to investigate the factors affecting training and the impact of these trainings on the development of human learning in organizations, and in this field, it uses different methods. The complexity of the educational environment and others, the different dimensions of human resources training and the different conditions and influencing factors based on blended learni...

متن کامل

Tree Based Hierarchical Reinforcement Learning

In this thesis we investigate methods for speeding up automatic control algorithms. Specifically, we provide new abstraction techniques for Reinforcement Learning and Semi-Markov Decision Processes (SMDPs). We introduce the use of policies as temporally abstract actions. This is different from previous definitions of temporally abstract actions as we do not have termination criteria. We provide...

متن کامل

Georeferencing Semi-Structured Place-Based Web Resources Using Machine Learning

In recent years, the shared content on the web has had significant growth. A great part of these information are publicly available in the form of semi-strunctured data. Moreover, a significant amount of these information are related to place. Such types of information refer to a location on the earth, however, they do not contain any explicit coordinates. In this research, we tried to georefer...

متن کامل

Transient brain activity disentangles fMRI resting-state dynamics in terms of spatially and temporally overlapping networks

Dynamics of resting-state functional magnetic resonance imaging (fMRI) provide a new window onto the organizational principles of brain function. Using state-of-the-art signal processing techniques, we extract innovation-driven co-activation patterns (iCAPs) from resting-state fMRI. The iCAPs' maps are spatially overlapping and their sustained-activity signals temporally overlapping. Decomposin...

متن کامل

Artificial Intelligence Techniques for Misuse Detection in Telecommunications Environments Bournemouth University Software Systems Modelling

This report considers the application of Artificial Intelligence (AI) techniques to the problem of misuse detection within telecommunications environments. A broad survey of techniques is provided, that covers inter alia rule based systems, case based reasoning, pattern matching, clustering and feature extraction, artificial neural networks, genetic algorithms, artificial immune systems, agent ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011